Name | Version | Summary | date |
optimum-benchmark |
0.2.0 |
Optimum-Benchmark is a unified multi-backend utility for benchmarking Transformers, Timm, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes. |
2024-05-16 11:36:29 |
sparseml-nightly |
1.8.0.20240509 |
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models |
2024-05-09 22:43:52 |
grag |
0.0.1 |
A simple package for implementing RAG |
2024-05-09 21:21:22 |
vector-quantize-pytorch |
1.14.22 |
Vector Quantization - Pytorch |
2024-05-09 17:23:56 |
optimum |
1.19.2 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2024-05-09 11:10:15 |
sparsezoo-nightly |
1.8.0.20240506 |
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes |
2024-05-06 19:37:51 |
autoawq |
0.2.5 |
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. |
2024-05-02 18:32:41 |
nncf |
2.10.0 |
Neural Networks Compression Framework |
2024-04-25 12:01:53 |
optimum-intel |
1.16.1 |
Optimum Library is an extension of the Hugging Face Transformers library, providing a framework to integrate third-party libraries from Hardware Partners and interface with their specific functionality. |
2024-04-25 08:06:39 |